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Abstract 

Non  destructive  testing  methods  are  often  used  in  order  to  de¬ 
tect  and  classify  structural  flaws.  The  detection  of  structural 
flaws  is  useful  for  maintenance.  In  this  paper  we  propose  to 
classify  flaws  in  ferromagnetic  materials  by  measuring  Eddy 
currents.  Our  approach  consists  of  two  steps.  First,  we  use 
a  system  identification  algorithm  to  find  a  dynamical  system 
which  describes  the  data.  Then,  we  use  the  parameters  of  this 
dynamical  system  as  a  feature  vector  and  we  use  support  vec¬ 
tor  machines  in  order  to  classify  the  various  cracks.  We  test 
our  method  on  a  well-known  benchmark. 

1.  Introduction  and  motivation 

Non  destructive  testing  methods  are  used  for  checking  the 
presence  of  structural  flaws  (cracks,  deformations,  etc.)  of 
materials  without  causing  damage.  This  is  useful  for  pre¬ 
dictive  maintenance.  The  most  important  methods  for  non¬ 
destructive  detection  of  structural  flaws  are  the  following:  ul¬ 
trasonic  (Cantrell  &  Yost,  2001),  acoustic  emission  (Madaras, 
Prosser,  &  Gorman,  2005),  terahertz  ray  (Nemec,  Kuzel,  Garet, 
&  Duvillaret,  2004),  X-ray  (Elaqra,  Godin,  Peix,  R’Mili,  & 
Fantozzi,  2007),  thermal  (Clark,  McCann,  &  Forde,  2003), 
optical  method  (Jie,  Siwei,  Qingyong,  Hanqing,  &  Sheng- 
wei,  2009)  and  eddy  currents  (EC)  (Smid,  Docekal,  &  Kreidl, 
2005). 

In  this  paper,  we  propose  using  Eddy  currents  for  detecting 
flaws.  Methods  based  on  measuring  Eddy  currents  are  pop¬ 
ular,  because  measuring  Eddy  currents  is  cheap  and  it  allows 
detecting  clogged  defects  and  to  classify  cracks.  For  any 
classification  method,  feature  extraction  is  one  of  the  critical 
steps.  For  Eddy  currents,  several  feature  extraction  methods 
exist  in  the  literature.  (Jo  &  Lee,  2009;  Song  &  Shin,  2000; 
Liu  et  al.,  2013)  use  maximum  amplitude  and  phase  angle  or 
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width  of  defect  signal;  (Oukhellou,  Aknin,  &  Perrin,  1999; 
Smid  et  al.,  2005)  focus  on  the  Fourier  or  wavelets  transform 
parameters;  (Lingvall  &  Stepinski,  2000;  Ye  et  al.,  2009)  use 
principal  component  analysis  or  its  kernel  version. 

In  comparison  to  the  existing  methods  for  feature  extraction, 
the  main  novelty  of  the  proposed  method  lies  in  using  param¬ 
eters  of  dynamical  systems  as  feature.  This  represents  a  novel 
application  of  system  identification  techniques  to  fault  detec¬ 
tion  and  health  monitoring  of  ferromagnetic  materials  based 
on  Eddy  currents. 

Our  approach  is  based  on  two  steps.  First,  using  the  measured 
data,  we  find  a  parametric  dynamical  model.  This  model  rep¬ 
resents  current  impedance  values  of  eddy  currents  as  function 
of  past  impedance  values.  We  use  a  system  identification  al¬ 
gorithm  for  identifying  the  model  parameters  based  on  mea¬ 
sured  data.  Thus,  the  obtained  parameters  serve  as  feature. 
We  assume  that  each  flaw  corresponds  to  an  unique  parame¬ 
ter  vector.  We  then  use  support  vector  machines  to  compute 
a  classifier  on  the  extracted  parameter  space. 

The  experimental  evaluation  shows  that  our  approach  gives 
good  results. 

The  paper  is  organized  as  follows.  Section  2  is  devoted  to  the 
problem  statement.  Section  3  presents  the  data  pre-processing 
step  including  denoising  and  re-sampling.  A  new  method  of 
feature  extraction  based  on  dynamical  systems  identification 
is  explained  in  Section  4.  The  Support  Vector  Machines  op¬ 
erating  is  briefly  described  in  Section  5.  Section  6  shows  an 
example  of  classification  of  flaws  using  Eddy  currents  .  Some 
conclusions  are  drawn  in  Section  7. 

2.  Problem  definition 

Eddy  currents  are  used  in  many  applications  of  non  destruc¬ 
tive  testing.  When  a  conductive  material  is  within  a  time- 
variable  magnetic  field  created  by  a  coil  subjected  to  an  al¬ 
ternative  current,  induced  Eddy  currents  are  developed  inside 
the  material  without  altering  its  characteristics.  When  an  in- 
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homogeneity,  a  change  in  geometry  or  a  flaw  is  present  in  the 
material,  variations  in  the  phase  and  magnitude  of  these  eddy 
currents  can  be  monitored,  as  they  lead  to  a  change  of  the  coil 
impedance.  This  is  the  principle  of  material  inspection  by 
Eddy  currents.  Sensors  travel  across  the  surface  of  the  mate¬ 
rial  and  the  variations  of  the  coil  impedance  are  acquired  and 
compared  with  an  impedance  reference.  Then,  in  presence  of 
a  crack,  the  impedance  data  varies  as  function  of  sensor  or 
material  displacement,  following  a  trajectory  into  a  complex 
plane  where  abscissa  is  the  resistance  and  the  reactance  is  the 
ordinate. 

By  firstly  considering  aluminum  structures,  the  goal  of  the 
method  presented  in  this  paper  is  to  propose  to  automati¬ 
cally  classify  the  type  of  defaults  or  cracks  using  as  input  the 
impedance  data  trajectory.  To  present  the  method,  an  exist¬ 
ing  database  1  composed  of  Eddy  currents  signatures  from 
aluminum  aircraft  structures  has  been  used.  The  database 
is  composed  of  twelve  types  of  crack,  characterized  by  both 
penetration  angle  into  the  material  and  depth  of  penetration. 
Figure  1  shows  characteristics  of  all  the  twelve  cracks.  For 
example,  the  first  crack  type  is  defined  by  1.5 mm  of  depth 
and  90°  of  penetration  angle. 


different  impedance  trajectories  for  several  types  of  crack. 


Figure  2.  Impedance  trajectories  of  four  crack  types. 
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Figure  1.  Aluminum  sample  with  machined  notches  of  dif¬ 
ferent  penetration  angles  and  depths. 


For  each  crack,  acquired  impedance  data  are  complex  discrete 
time  series 

5(fc)  =  x(k)+jy(k), 

where  the  resistance  curve  x(k)  and  the  reactance  curve  y(k) 
are  known.  Each  type  of  crack  is  scanned  20  times  by  a  coil. 
This  leads  to  20  impedance  trajectories.  Figure  2  illustrates 

1  freely  available  on  the  website  :  http://wireless.feld.cvut.cz/diagnolab/node/16 


The  originality  of  this  paper  lies  in  classifying  crack  types  us¬ 
ing  the  parameters  of  temporal  models  that  fit  the  impedance 
trajectories.  Each  component  of  the  trajectory  (resistance  and 
reactance)  will  be  considered  as  time  series  ARX  model  where 
its  parameters  will  be  used  to  classify  the  crack  type.  So,  the 
main  steps  of  the  proposed  method  are  : 

•  from  each  inspection,  extract  sequences  x{k)  and  y(k), 

•  estimate  0X  and  9y  the  parameters  associated  to  the  time 
series  models, 

•  knowing  the  set  of  6X  and  0y,  corresponding  to  the  whole 
inspection  database,  build  the  classifiers. 

Before  explaining  in  detail  the  estimation  and  the  classifica¬ 
tion  steps,  it  is  worth  introducing  remarks  on  the  data  prepro¬ 
cessing  step. 

3.  Data  preprocessing 

First,  in  order  to  reduce  noise  impact,  resistance  x  and  reac¬ 
tance  y  are  filtered.  A  standard  median  filter  is  used.  For 
k  >  1,  the  median  values 

x(k)  of  {x(k  —  Lx  +  1),  •  •  •  ,  x{k  +  Lx  —  1))}  and 
y(k )  of  {y(k-Lx  +  1),  -  -  -  ,y(k  +  Lx  -  1))}  respectively 
replace  x{k)  and  y(k)  where  2 Lx  —  1  end  2 Ly  —  1  are  the 
prescribed  length  of  the  median  filter  window. 

Moreover,  collected  data  come  from  a  manual  scanning.  This 
implies  that  the  scan  speed  varies  over  time  and  variations 
impact  the  impedance  curve  shape  for  the  same  crack.  To  re¬ 
duce  the  effect  of  variate  scanning  speed,  data  {x{k)}i ii  and 
{v(k)}^f=  i  are  re-sampled  in  order  to  get  the  same  number 
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of  points  for  each  crack.These  filtered  and  re-sampled  data 
{x(k)}^=1  and  {y(k)}^=1  will  serve  as  the  second  learning 
dataset. 

Each  type  of  cracks  is  characterized  by  the  following  two 
properties:  “depth”  and  “angle”.  Depth  refers  to  the  depth  of 
the  crack,  and  angle  refers  to  the  angle  formed  by  the  crack 
and  the  horizontal  axis.  That  is,  each  crack  is  identified  with 
a  pair  of  numbers  (depth,  angle),  where  depth  denotes  the 
depth  of  the  crack  and  angle  denotes  the  angle  of  the  crack. 
Hence,  we  can  classify  cracks  as  follows.  First,  we  construct 
a  classifier  which  determines  the  angle  associated  with  each 
crack  based  on  measurement  data.  In  this  way,  we  obtain 
several  groups  of  cracks,  each  group  representing  cracks  with 
the  same  angle.  Second,  for  each  angle  a,  we  construct  a 
classifier  which  determines  the  depth  of  a  crack  whose  angle 
equals  a.  This  classifier  will  use  measurements  to  determine 
the  depth.  Note  that  the  second  classifier  is  supposed  to  dis¬ 
tinguish  only  cracks  with  the  same  angle.  Figure  3  shows 
curves  of  impedance  values  for  cracks  with  60°  af  angle  and 
1.5mm,  1  mm,  0.7 mm,  0.4mm  of  penetration  depths.  Each 
curve  is  a  finite  collection  {(x(k),  y(A:))}^L1  of  data  points, 
where  x(k)  is  the  real  part  and  y(k)  is  the  complex  part  of 
z(k),  the  filtered  and  re-sampled  impedance  value  measured 
at  time  step  k.  It  can  be  seen  from  the  data  that  the  shapes 
of  the  four  cracks  are  similar  but  some  are  larger  than  other. 
Therefore,  we  can  assume  that  the  depth  does  not  have  impact 
on  the  shape  of  the  curve  except  its  magnitude.  For  this  rea¬ 
son,  we  will  apply  a  normalization  step  in  order  to  find  crack 
angles.  That  is  to  say,  before  computing  the  first  classifier, 
we  will  divide  the  data  points  by  a  constant.  Thus,  the  first 
dataset  is  composed  of 
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Figure  3.  Cracks  with  60°  of  angle  and  four  penetration 
depths. 


of  penetration)  determines  an  unique  pair  of  parameters  vec¬ 
tors,  where  each  parameters  vector  corresponds  to  a  dynam¬ 
ical  system.  It  is  worth  noting  that  each  pair  of  parameters 
vectors  is  extracted  from  one  crack  observation.  Thus,  pa¬ 
rameters  vectors  are  independent  from  each  over.  The  first 
dynamical  system  models  the  resistance: 

x(k)  =  Ol<j>x(k)  +€x(k),  (1) 


{(*(*),  s/(*0)}kU  = 


(x(k),y(k)) 

max  \/ x2(k )  +  y2(k) 

fe£{l,-,JV} 


.  Note  that  for  the  computation  of  the  second  classifier,  we 
will  use  the  original  data  points  {(x(fc),  y(k))}^=1,  since  the 
crack’s  depth  influences  the  magnitude  of  the  data  points  and 
thus  normalization  could  lead  to  loss  of  information. 


4.  Feature  extraction  based  on  dynamical  sys¬ 
tems 

As  we  have  seen  before,  each  crack  observation  is  composed 
of  time  series  data  points  arising  from  Eddy  currents  measure¬ 
ments.  It  can  be  represented  as  curve  in  the  complex  plane, 
since  each  impedance  value  is  a  complex  number.  However, 
such  a  representation  discards  the  temporal  dependence  be¬ 
tween  various  data  points.  For  this  reason,  in  order  to  com¬ 
pute  classifiers,  we  will  use  the  time  series  {x(k)}^=1  and 
{y(k)}k Li-  We  will  use  these  time  series  to  compute  a  dy¬ 
namical  system  whose  input-output  behavior  is  consistent  with 
them.  We  assume  that  each  group  of  flaws  (i.e,  each  angle 


and  the  second  one  models  the  reactance: 

y{k)  =  0y<l>y{k)+£y(k),  (2) 

where 

<t>x(k)  =  [x(k  —  1),- •  •  ,x(k  -  nax),y(k  -  1),  •  •  •  ,y(k  -  nbx),l]T, 
<t>y(k)  =  \y(k- !)>'••  ,y(k~nay),x(k-  l),---  ,x(k-nby),l]  , 
K.riiJ  and  (nay,nby)  are  models  orders, {£x(k)}k>v 
{sy(k)  }/,>  |  are  independent  identically  distributed  random 
sequences  and  9X,  9y  are  the  associated  parameters  vectors. 

We  assume  that  for  each  crack,  the  pair  of  parameters  vectors 
(Ox  -  9 y )  determine  the  angle  of  the  crack.  More  precisely,  we 
assume  that  these  pairs  are  close  when  cracks  belong  to  the 
same  group,  i.e,  the  penetration  angle  is  the  same. 

In  this  paper,  we  will  use  the  Recursive  Least-Squares  method 
( abbreviated  by  RLS )  as  linear  system  identification  algorithm 
because  its  recursive  form  is  exact  comparatively  to  the  Feast 
Mean  Squares  (FMS)  and  then,  it  converges  more  quickly  to 
the  solution.  The  RLS  algorithm  was  proposed  to  determine 
the  parameter  9  of  the  equation 

w(k)  =  9Ti/j(k)  +  £(fc)  (3) 
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from  finitely  many  measurements  {w(k),ip(k)}k_1.  Note 
that  both  (1)  and  (2)  are  of  the  form  (3)  with  a  suitable  choice 
of  9{k),  ip(k),  £(fc)  and  w(k).  Below  we  describe  the  RLS  al¬ 
gorithm.  We  will  follow  the  presentation  of  (Ljung  &  Soder- 
strom,  1983).  Let  I  be  the  identity  matrix  and  let  P(0)  =  al 
be  the  initial  auto-correlation  matrix  of  data.  For  each  new 
observation  ( w(k ),  ip(k)),  the  update  of 
(oT (k  —  1  ),P(k  —  l)j  is  given  by: 


m 

p{k) 


w(k )  —  dT(k  —  1  )ip(k), 

Q{k_iy  ak)p(k-i)m 


a~1P(k  —  1) 


a  +  i p(k)TP(k  —  l)if)(k)  ’ 

ip{k)ip(k)TP(k  -  1) 


I- 


a  +  i/j(k)P{k  —  1  )Tip(k) 

(4) 


where  a  £  [0, 1]  is  the  forgetting  factor. 


5.  Classification  of  cracks 

The  goal  of  this  part  is  to  identify  the  class  membership  of 
each  crack.  The  classification  is  in  two  step.  The  first  one  is 
for  discriminating  the  penetration  angle  into  the  material  and 
the  second  one  is  for  selecting  the  penetration  depth.  Sev¬ 
eral  methods  exist  in  classification  theory.  For  labeled  data, 
the  Support  Vector  Machines  (SVM)  developed  in  (Vapnik, 
2000)  achieve  excellent  performance  according  to  (Caruana 
&  Niculescu-Mizil,  2006;  Khelil,  Boudraa,  Kechida,  &  Drai, 
2005).  The  following  lines  briefly  explain  SVM  principle. 


5.1.  Two  classes  SVM 


variables  e*,  >  0,  1  <  k  <  N  are  introduced  in  order  to  solve 
the  problem.  The  optimization  problem  becomes 


min 

w,b,ek 


such  that 


1 

2 


N 

( w ,  w)  +  C  E  e/c 

k= 1 

yk{wTxk  +  b)  >  1  -  ek 

£fc  >  0 


for  1  <  k  <  N, 


(7) 

where  C  is  a  positive  real  constant  for  determining  the  toler¬ 
ance  of  the  SVM  to  the  poorly  separated  observations.  The 

N 


solution  are  w  =  7 kykxk  and  b  =  bo  where  jk  >  0  for 


fc= l 

1  <  k  <  N  and  bo  are  obtained  by  solving  the  dual  formula¬ 
tion  of  (7).  Then,  the  decision  function  is 


N 


f(x)  =  sign  E  7 kVk  {xk,  x)  +  b0 


(8) 


^fc=i 


When  datasets  are  linearly  non-separable,  the  trick  is  to  project 
them  on  a  high-dimensional  feature  space  by  using  a  nonlin¬ 
ear  map  such  as  the  projections  are  linearly  separable. 
Thanks  to  the  Mercer’s  condition  (Mercer,  1909),  the  calcula¬ 
tion  of  the  dot  product  (ip(xk>),  ip(xk)}  which  often  requires 
a  lot  of  computational  resources  is  replaced  by  the  calculation 
of  the  kernel  K(xk>  ,xk).  Two  kernels  are  widely  used:  the 

/  lljj;  ,  _  £fc||2\ 

Gaussian  kernel  K(xk> ,  xk)  =  exp  I - k  —  2 - j  and 

the  homogenous  polynomial  kernel  K(xk' ,  xk)  =  (. xk > ,  xk)d, 
where  cr  an  d  are  tuning  parameters. 


The  SVM  has  been  firstly  used  to  separate  two  classes.  Its 
principle  is  to  maximize  the  separation  margin  ;  the  margin 
being  the  distance  between  the  closest  observations  and  the 
separator. 

Consider  a  given  training  set  {xk,yk}l<k<N  where  the  ob¬ 
servation  xk  £  K.  and  the  class  variable  yk  £  {  —  1,1}.  Sup¬ 
pose  that  data  are  linearly  separable,  i.e,  there  exists  a  linear 
classifier  (w,  b)  such  as 


In  this  paper,  these  parameters  are  selected  as  those  minimize 
the  leave-one-out  cross  validation  error  whose  the  procedure 
consists  of: 

•  splitting  the  data  set  of  size  k  into  k  smaller  subsets 

•  a  model  is  trained  using  k-1  subsets  as  training  data 

•  the  resulting  model  is  validated  on  the  remaining  subset 

•  the  previous  both  lines  are  repeated  k  times. 


wTxk  +  b  >  +1  if  yk  =  +1 
wTxk  +  b  <  -1  if  yk  =  -1 


(5) 


The  problem  of  finding  the  separator  which  maximizes  the 
margin  is  equivalent  to  : 


After  using  a  kernel,  the  decision  function  (8)  becomes 

f(x)  =  sign  {7  ':>■  UsJ'i  (Xk.x)  +  -  (9) 


■5!?  \l'w’w)  (« 

constraint  to  yk(wTxk  +  b)  >  1,  1  <  k  <  N, 

where  (, )  is  the  dot  product. 

Generally,  data  are  not  separable.  In  this  case,  the  margin  of 
some  observations  are  allowed  to  be  less  than  one.  Slacks 


5.2.  Multi-class  SVM 

The  mutli-class  SVM  is  an  extended  version  of  two  classes 
SVM.  Here,  it  is  supposed  that  the  number  of  classes  is  greater 
than  two. 

Here,  the  One  Against  One  SVM  is  used  for  classifying  more 
than  two  classes.  This  approach  is  very  intuitive.  It  consists 
of  making  several  classifiers  in  order  to  compare  pairs  classes. 
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Thus,  for  classifying  a  dataset  between  M  classes,  we  need  to 

make  -  — - -  separators  (Moreira  &  Mayoraz,  1998).  A 

majority  vote  across  the  classifiers  is  applied  to  classify  a  new 
observation. 

In  brief,  the  global  crack  classification  procedure  is  given  by 
algorithm  1. 

6.  Experimental  results 

In  order  to  test  the  reliability  of  our  method,  a  database  2  com¬ 
posed  of  Eddy  currents  signatures  from  aluminum  aircraft 
structures  is  used.  In  this  database,  there  are  twelve  types 
of  crack.  Each  type  of  crack  is  characterized  by  the  angle  and 
the  penetration  depth,  is  recorded  20  times.  Figure  1  shows 
characteristics  of  all  the  twelve  cracks. 

The  first  classification  task  is  devoted  to  the  penetration  an¬ 
gle.  Four  groups  are  created  from  the  twelve  types  of  crack. 
The  first  group  contains  80  cracks  with  90°  of  angle  and  1.5, 
1,  0.7,  0.4  mm  of  penetration  depth.  The  second  group  con¬ 
tains  80  cracks  with  60°  of  angle  and  1.5,  1,  0.7,  0.4  mm  of 
penetration  depth.  In  the  third  group,  there  are  20  cracks  with 
45°  of  angle  and  1.5  mm  of  penetration  depth.  The  last  group 
contains  60  cracks  with  30°  of  angle  and  1.5,  1,  0.7  mm  of 
penetration  depth. 

The  parameters  used  in  the  first  part  of  classification  algo¬ 
rithm  (see  algorithm  1)  are  the  following.  The  median  filters 
widows  size  are  Lx  =  30  observations  and  Ly  =  30  obser¬ 
vations.  The  fixed  number  of  observations  is  N  =  300.  The 
re-sampling  factor  is  factor  =  N/M  where  M  is  the  curve 
number  of  observations.  The  Recursive  Least  Squares  for¬ 
getting  factor  is  a  =  9.99  x  10_1,  its  value  for  the  initial 
auto-correlation  matrix  is  a  =  10  and  its  initial  parameters 
0s (0),  0y( 0)  are  randomly  selected. 

Each  crack  observation  is  composed  of  time  series  data  points 
arising  from  Eddy  currents  measurements.  For  discriminating 
different  cracks,  our  procedure  consists  to  extract  represen¬ 
tative  parameters  vector  from  each  crack  observations.  The 
extracted  parameters  vectors  are  independent  from  each  over. 
Several  values  of  order  are  tested  for  both  dynamical  systems 
identification  (1)  and  (2)  in  order  to  select  the  model  param¬ 
eters.  The  values  which  minimize  the  least  squares  errors  are 
(naiD,nbJ  =  (0,2)  and  ( nay,nby )  =  (0,2).  Figure  4  shows 
estimations  of  measured  resistance  and  reactance.  Both  esti¬ 
mated  curves  obtained  from  two  order  dynamical  systems  are 
close  to  real  curves. 


Algorithm  1  Procedure  of  cracks  classification 

1:  Have  Eddy  currents  measuring  data 

2:  Removal  of  edge  data:  a  set  of  uninformative  data  (null 
data)  to  the  right  of  each  curve  is  deleted 

3:  Data  filtering:  set  the  windows  sizes  Lx  and  Ly  of  me¬ 
dian  filters 

4:  Duplication  of  the  dataset  {(x(k),  y(k))}^L1  :  the  1st 
dataset  will  be  processed  for  discriminating  angle  and  the 
2nd  dataset  will  be  used  for  discriminating  depth  without 
further  processing 

5:  Re-sampling  of  resistance  and  reactance  curves  extracted 
from  the  1st  dataset  :  set  the  re-sampling  factor  factor 
in  order  to  obtain  the  same  number  of  observations  N  in 
each  curve;  {(x(k),  y(k))}^=1  is  the  obtained  dataset. 

6:  Normalization:  resistance  and  reactance  curves  extracted 
from  the  1st  dataset  are  divided  by  the  magnitude: 


(x(k),y{k)) 

max  ^Jx2(k)  +  y2{k) 
fce  { t  ,•  •  •  ,n} 


N 


fc= 1 


7:  Extraction  of  dynamical  system  parameters  vectors 


(0X.  6V)  from  {(x(k) ,  y(k))}^=1  and  the  Recursive  Least 
Squares  algorithm: 

•  set  the  forgetting  factor  a  £  [0, 1],  the  value  a  for 
the  initial  auto-correlation  matrix  of  data  P(0)  = 
al,  and  (0X(O),  9y{ 0))  the  initial  parameters  vectors 

•  for  i  e  {1,  •  •  •  ,  N}  do 

€x(k)  <-  x(k)  —  9x(k  —  1  )4>x{k), 

9(k)  ^  e  .(k-i)  +  k(*W~i)*«(fc) 

ot  +  f>x{k)T  P{k  —  l)(f>x{k)' 


P{k )  <- 

end  for 


P(k  -  1) 


7- 


Mk)MkVP(k-  1) 


a  +  Mk)P(k-l)TMk ) 


•  find  0y  as  in  the  above  lines 

8:  Using  a  multi-class  Support  Vector  Machines  for  de¬ 

termining  angle: 

•  set  0  the  concatenation  of  9X  and  9y 

•  the  dataset  of  9  is  separated  in  two  parts,  the  first 
part  will  be  used  for  training  and  the  second  part  for 
evaluation  algorithm  performance 

•  set  the  kernel,  the  kernel  parameter  and  the  positive 
constant  of  tolerance  C 


9:  Using  a  multi-class  Support  Vector  Machines  and  the 

2nd  dataset  {(x(k),y(k))}%L1  for  determining  the  depth 
among  depths  belonging  to  the  previously  found  angle 
group: 

•  calculation  of  the  magnitude  mg  : 


mg  =  max  \J  x2(k)  +  y2(k) 

fce{ l,---  ,m} 


It  is  worth  nothing  that  the  goal  of  our  extracted  parameters 
is  not  exactly  to  predict  the  real  curves  of  resistance  and  re¬ 
actance.  The  goal  of  prediction  is  to  evaluate  whether  the 
extracted  parameters  vector  explains  the  dynamic  of  the  con¬ 
sidered  time  series.  The  evaluation  step  allows  defining  the 

2Free  available  on  the  website  :  http://wireless.feld.cvut.cz/diagnolab/node/16 


•  the  subset  of  the  2nd  dataset  containing  magnitudes 
of  the  previously  found  angle  is  used  for  training 
and  the  second  part  of  data  used  in  step  8  is  used  for 
the  evaluation 

•  set  the  kernel,  the  kernel  parameter  and  the  positive 
constant  of  tolerance  C 
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quality  of  extracted  feature  so  that  this  one  could  be  used  to 
the  second  step.  The  assumption  is  that  whatever  the  type  of 
crack,  the  used  model  structure  is  fixed.  Then,  only  the  pa¬ 
rameter  of  the  identified  model  will  be  used  to  characterize 
the  type  of  crack. 


Figure  4.  Estimation  of  resistance  and  reactance  from  two 
order  dynamical  systems  model. 


For  each  crack  data  recorded,  the  pair  of  extracted  parameters 
vectors  (9Xl  9y)  belongs  to  R3  x  R3.  Figures  5  and  6  show 
these  parameters  in  two  three-dimensional  spaces.  It  can  be 
seen  that  the  four  groups  are  well  separated.  Before  classify¬ 
ing,  parameters  9X  and  9y  are  concatenated.  The  new  vector 
from  this  concatenation  9  belongs  to  R6. 

The  One  Against  One  SVM  with  a  Gaussian  kernel  is  used 
for  the  classification.  The  pair  (<r,  C)  of  the  kernel  param¬ 
eter  and  the  constant  of  tolerance  are  searched  into  a  grid 
[l0_5,104]  x  [l0_5,104]  containing  100  equidistant  pairs. 
The  selected  pair  (a  =  10,  C  =  10)  is  that  minimizes  the  miss- 
classification  error  rate  after  using  leave-one-out  cross  vali¬ 
dation.  The  minimum  of  the  miss-classification  error  rate  is 
equal  to  1.25%.  Tables  1  shows  the  confusion  matrix  of  the 
One  Against  One  SVM.  Thus,  we  can  say  that  our  approach 
has  a  good  classification  performance  with  respect  to  the  pen¬ 
etration  angle  into  material. 

In  order  to  evaluate  the  efficiency  of  our  classification  proce¬ 
dure,  we  are  going  to  realize  the  classification  with  extracted 
parameters  obtained  by  a  Principal  Component  Analysis  ap¬ 
plied  on  Fourier  descriptors  (FD-PCA).  The  FD-PCA  proce¬ 
dure  consists  firstly  to  extracted  Fourier  descriptors  from  our 
crack  observations.  Discrete  Fourier  descriptors  are  defined 


as 

1  N 

df(p )  =  —'^2z(k)exp(-j2np{k  -  1  )/N),  p=  1,  -  ••  ,N 

v  fc= l 

After  the  calculation  of  the  discrete  Fourier  descriptors  for 
each  crack  observation,  the  most  representative  descriptors 
are  selected  by  using  Principal  Component  Analysis.  Three 
complex  descriptors  are  chosen  and  these  ones  represent  93% 
of  the  total  variance.  By  using  One  Against  One  SVM  and 
the  leave-one-out  cross  validation,  the  miss-classification  er¬ 
ror  rate  is  equal  to  7.5%.  Tables  2  shows  the  confusion  matrix 
of  the  classification  based  on  the  FD-PCA  procedure. 

It  can  be  seen  that  our  extraction  procedure  based  on  dynam¬ 
ical  systems  combined  to  SVM  gives  better  results  than  the 
FD-PCA  procedure  combined  to  SVM  for  the  same  number 
of  extracted  parameters  (6  real  parameters  for  the  first  one 
and  3  complex  parameters  for  the  second  one).  Thus,  our 
classification  algorithm  is  efficiency  and  promising. 


Figure  5.  Illustration  of  the  parameter  9X  for  the  four  crack 
groups. 


Table  1.  Confusion  matrix  of  the  One  Against  One  SVM. 


Predicted  Ciroup 

Actual  Group 

Group  1 

Group  2 

Group  3 

Group  4 

Group  1 

80 

0 

0 

0 

Group  2 

0 

80 

0 

0 

Group  3 

0 

0 

19 

1 

Group  4 

0 

1 

1 

58 

The  second  step  of  classification  consists  of  discriminating 
the  penetration  depth  for  cracks  with  the  same  penetration 
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Figure  6.  Illustration  of  the  parameters  0y  for  the  four  crack 
groups. 


Table  2.  Confusion  matrix  of  the  One  Against  One  SVM  ap¬ 
plied  on  the  FD-PCA  feature  extraction. 


Predicted  Group 

Actual  Group 

Group  1 

Group  2 

Group  3 

Group  4 

Group  1 

77 

3 

0 

0 

Group  2 

8 

71 

1 

0 

Group  3 

0 

2 

17 

1 

Group  4 

0 

1 

2 

57 

angle.  To  do  this,  the  second  dataset,  i.e,  without  normaliza¬ 
tion  is  used.  Figures  7  shows  the  second  classification  step  in 
group  1 .  Similar  figures  are  obtained  for  groups  2  and  4.  For 
cracks  classified  in  group  3,  there  is  not  a  second  step  because 
the  penetration  depth  is  unique  (1.5mm).  According  to  this 
figure,  recorded  data  can  be  well  separated  by  hyperplanes. 
The  One  Against  One  SVM  with  a  Gaussian  kernel  is  used  for 
the  three  classifications  in  the  second  step.  As  we  previously 
mentioned,  the  pair  (cr,  C)  of  the  kernel  parameter  and  the 
constant  of  tolerance  are  searched  into  a  grid  [l0— 5 , 104]  x 
[l0_5,104]  containing  100  equidistant  pairs.  The  selected 
pairs  of  kernel  parameter  and  constant  of  tolerance  of  group  1, 
2  and  4  are  receptively  (cr  =  1,  C  =  1),  (a  =  0.1,  C  =  0.1) 
and  (cr  =  0.1,  C  =  0.1).  The  miss-classification  error  in  step 
two,  i.e  for  the  three  classifications  is  null.  In  other  words, 
according  to  available  data,  if  a  crack  is  classified  in  the  right 
group  (ie,  if  the  right  angle  is  selected),  the  right  depth  is  au¬ 
tomatically  selected.  Hence,  the  global  miss-classification  er¬ 
ror  is  equal  to  the  first  step  miss-classification  (approximately 
equal  to  1%). 


Remark. 

The  previous  classification  uses  Gaussian  kernel.  However, 
when  Gaussian  kernel  is  replaced  by  homogenous  polynomial 
kernel  respectively  with  parameters  (d  =  3,C  =  10~3),  the 
same  miss-classification  error  is  obtained.  Our  classification 
seems  robust  with  respect  to  the  selected  SVM  kernel. 


7.  Conclusion 

This  paper  addresses  the  problem  of  classifying  cracks  by  us¬ 
ing  measurements  of  Eddy  currents.  The  paper  proposes  a 
new  approach  for  cracks  classification  which  uses  dynamical 
systems.  The  parameters  of  these  dynamical  systems  form 
the  feature  space.  The  parameters  vectors  are  found  from  the 
measured  data  by  using  an  algorithm  from  systems  identifi¬ 
cation.  The  classification  is  done  in  two  steps.  The  first  one 
is  to  group  cracks  according  to  theirs  penetration  angles  into 
the  material.  The  second  one  is  to  group  them  according  to 
penetration  depths.  Our  method  is  evaluated  on  a  particularly 
challenging  benchmark,  for  which  the  cracks  are  more  dif¬ 
ficult  to  classify  due  to  the  variation  of  the  scanning  speed. 
After  using  multi-class  SVM  for  both  steps  classification,  the 
miss-classification  error  is  approximately  equal  to  1%.  This 
means  that  our  approach  is  efficiency  and  promising. 
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